Disambiguation in determining phonemes of sound-imitation words for environmental sound recognition
نویسندگان
چکیده
Onomatopoeia, or sound-imitation words (SIWs) are important in informing sound events in human-computer communication. One problem is listener-dependency in recognizing environmental sounds by means of SIWs, that is, different listener hears the same environmental sound as a different SIW even under the same condition. Therefore, the use of usual Japanese phonemes is not adequate to express SIWs. To cope with this ambiguity problem of phoneme determination, we designed a set of new phonemes, referred to as the basic phoneme-groups, to represent environmental sounds. The basic phonemegroup consists of one or more Japanese phonemes, and thus the ambiguity problem is resolved based on it by generating one or more SIWs for a sound event. An HMM-based scheme is adopted to recognize SIWs using the phoneme-groups. Listening experiments with seven subjects showed that automatic SIW recognition based on the basic phoneme-groups outperformed ones based on the other types of phonemes. The recall and precision rate were 56.4% and 72.2%, respectively.
منابع مشابه
Sound-Imitation Word Recognition for Environmental Sounds Disambiguation in Determining Phonemes of Sound-Imitation Words
Environmental sounds are very helpful in understanding environmental situations and in telling the approach of danger, and sound-imitation words (sound-related onomatopoeia) are important expressions to inform such sounds in human communication, especially in Japanese language. In this paper, we design a method to recognize sound-imitation words (SIWs) for environmental sounds. Critical issues ...
متن کاملAutomatic transformation of environmental sounds into sound-imitation words based on Japanese syllable structure
Sound-imitation words, a sound-related subset of onomatopoeia, are important for computer-human interaction and automatic tagging of sound archives. The main problem of automatic recognition of sound-imitation word is that the literal representation of such words is dependent on listeners and influenced by a particular cultural history. Based on our preliminary experiments of such dependency an...
متن کاملEffects of sound pillow in the treatment of stuttering and cognitive phonemes impairment in children
Introduction:Verbal language is Fundamental component for expressing ideas, social interaction and understanding educational materials. Effective communications require verbal language skills. Sound pillows may partly address the children with behavior problems. The purpose of this study was assessing the effect of educational sound pillow in the treatment of stuttering and cognitive phonemes i...
متن کاملPersian Cued Speech: The Effect on the Perception of Persian Language Phonemes and Monosyllabic Words with and without Sound in Hearing Impaired Children
Objectives: This paper studies the effect of Persian Cued Speech on the perception of Persian language phonemes and monosyllabic words with and without sound in hearing impaired children. Cued Speech is a sound based mode of communication for hearing impaired people that is comprised of a limited series of hand complements and the normal pattern of speech. And it is shown that it effectively ca...
متن کاملOptimal event search using a structural cost function - improvement of structure to speech conversion
This paper describes a new and improved method for the framework of structure to speech conversion we previously proposed. Most of the speech synthesizers take a phoneme sequence as input and generate speech by converting each of the phonemes into its corresponding sound. In other words, they simulate a human process of reading text out. However, infants usually acquire speech communication abi...
متن کامل